Evolutionary Trends of the Transposase-Encoding Open Reading Frames A and B (orfA and orfB) of the Mycobacterial IS6110 Insertion Sequence
نویسندگان
چکیده
BACKGROUND The IS6110 insertion sequence, a member of the IS3 family of insertion sequences, was found to be specific to the Mycobacterium tuberculosis complex (MTBC). Although IS6110 has been extensively characterized as a transposable genetic marker, the evolutionary history of its own transposase-encoding sequence has not, to the best of our knowledge, been investigated. METHODOLOGY/PRINCIPAL FINDINGS Here we explored the evolution of the IS6110 sequence by analysing the genetic variability and the selective forces acting on its transposase-encoding open reading frames (ORFs) A and B (orfA and orfB). For this purpose, we used a strain collection consisting of smooth tubercle bacilli (STB), an early branching lineage of the MTBC, and present-day M. tuberculosis strains representing the full breadth of genetic diversity in Tunisia. In each ORF, we found a major haplotype that dominated over a flat distribution of rare descendent haplotypes, consisting mainly of single- and double-nucleotide variant singletons. The predominant haplotypes consisted of both ancestral and present-day strains, suggesting that IS6110 acquisition predated the emergence of the MTBC. There was no evidence of recombination and both ORFs were subjected to strict purifying selection, as demonstrated by their dN/dS ratios (0.29 and 0.51, respectively), as well as their significantly negative Tajima's D statistics. Strikingly, the purifying selection acting on orfA proved much more stringent, suggesting its critical role in regulating the transpositional process. Maximum likelihood analyses further excluded any possibility of positive selection acting on single amino acid residues. CONCLUSIONS/SIGNIFICANCE Taken together our data fit with an evolutionary scenario according to which the observed variability pattern of the IS6110 transposase-encoding ORFs is generated mainly through random point mutations that accrued on a functionally optimal IS6110 copy, whose acquisition predated the emergence of the MTBC complex. Background selection acting against deleterious mutations led to an excess of low-frequency variants.
منابع مشابه
Bias between the left and right inverted repeats during IS911 targeted insertion.
IS911 is a bacterial insertion sequence composed of two consecutive overlapping open reading frames (ORFs [orfA and orfB]) encoding the transposase (OrfAB) as well as a regulatory protein (OrfA). These ORFs are bordered by terminal left and right inverted repeats (IRL and IRR, respectively) with several differences in nucleotide sequence. IS911 transposition is asymmetric: each end is cleaved o...
متن کاملEfficient transposition of IS911 circles in vitro.
An in vitro system has been developed which supports efficient integration of transposon circles derived from the bacterial insertion sequence IS911. Using relatively pure preparations of IS911-encoded proteins it has been demonstrated that integration into a suitable target required both the transposase, OrfAB, a fusion protein produced by translational frameshifting between two consecutive op...
متن کاملEvolutionary dynamics of insertion sequences in Helicobacter pylori.
Prokaryotic insertion sequence (IS) elements behave like parasites in terms of their ability to invade and proliferate in microbial gene pools and like symbionts when they coevolve with their bacterial hosts. Here we investigated the evolutionary history of IS605 and IS607 of Helicobacter pylori, a genetically diverse gastric pathogen. These elements contain unrelated transposase genes (orfA) a...
متن کاملTransposable element ISHp608 of Helicobacter pylori: nonrandom geographic distribution, functional organization, and insertion specificity.
A new member of the IS605 transposable element family, designated ISHp608, was found by subtractive hybridization in Helicobacter pylori. Like the three other insertion sequences (ISs) known in this gastric pathogen, it contains two open reading frames (orfA and orfB), each related to putative transposase genes of simpler (one-gene) elements in other prokaryotes; orfB is also related to the Sal...
متن کاملSequence and transcriptional analyses of the fish retroviruses walleye epidermal hyperplasia virus types 1 and 2: evidence for a gene duplication.
Walleye epidermal hyperplasia virus types 1 and 2 (WEHV1 and WEHV2, respectively) are associated with a hyperproliferative skin lesion on walleyes that appears and regresses seasonally. We have determined the complete nucleotide sequences and transcriptional profiles of these viruses. WEHV1 and WEHV2 are large, complex retroviruses of 12,999 and 13,125 kb in length, respectively, that are close...
متن کامل